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"VOTRAX is a commercially available voice synthesizer 
for use with a digital computer. This thesis describes the 
design and implementation of a VOTRAX terminal for use with 
the Fairchild P24 computer. 

Chapters of the thesis consider the audio response 
technology, some characteristics of Phonetic English Speech, 
configuration of hardware, and describe the PHONO computer 
program which was developed. The last chapter discusses the 
advantages of the VOTRAX voice synthesizer and proposes a 
future version of the system with a time-sharing host 
computer. 
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CHAPTER I 


INTRODUCTION 

Audio response is in many instances an ideal component 
in man-machine communication (1). This approach can he used 
in combination with other means of presentation to provide 
an oral explanation of text (2) or music* to keep up the 
students* interest. 

The attractive features of the audio response in man- 
machine communication include the following* 

1. Audio response is a natural way for humans to 
receive communications from others. 

2. Audio messages can be received without inter¬ 
rupting the use of the manual or visual channels, in 
motion, or in total darkness. 

3. Any nmber of listeners can receive the same 
audio message simultaneously. 

There are some eighteen American suppliers (3) of voice 
output equipment. The VOTRAX** Voice Synthesizer (VOTRAX) 

*The computer aided instruction music learning system 
currently under development in the Department of Computer 
Sciences, North Texas State University, Denton, Texas. 

**Trade"mark of Vocal Interface Division of the Federal 
Screw Works, Troy, Michigan. 
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appeared to be ideal for Computer Aided Instruction (CAI) 
Systems because it is relatively inexpensive and digitally 
controllable. VOTRAX has an essentially unlimited English 
vocabularyi in contrast to other devices, which usually 
select only prerecorded words. The basis for the vocabulary 
of VOTRAX is that it accepts sequence of digital phoneme 
commands (a phoneme is a basic unit of speech) and translates 
these commands into corresponding phonetic audio message. 

Algorithms have been developed (l) which convert English 
text directly into the corresponding phonemes to drive the 
synthesizer. By using an optical reading device, printed 
text may be scanned and automatically converted into speech. 
It is speculated that this system could then be used as an 
aid to the blind (2). 

Purpose of the Study 

This thesis describes an effort to program the VOTRAX 
interactively under control of a Fairchild F24 mini-computer. 
The PHONO program, written in F24 Assembler Language, pro¬ 
vides for user English instruction commands of the VOTRAX. 

The primary design problem was to allow users to synthesize 
speech and to develop new vocabularies which can be stored in 
the vocabulary table of the computer and used upon demand. 
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CHAPTER II 


PHONETIC ENGLISH SPEECH 

The material for this chapter was derived from 
Reference 1, for the convenience of the reader’s under¬ 
standing and reference. 

Table I lists all the VOTRAX phonemes (1) and their 
probable usage. 


TABLE I 

VOTRAX PHONEMES AND USAGE 



Phoneme Usage 
Name 

Phoneme Usage 
Name 

Phoneme Usage 

Name 


A, AY 

eight 

A1 

eighty 

AE 

cat 

AEl 

antenna 

AH 

car 

AH, lY 

nine 

AHl 

connect 

AW 

law 

AWl 

fault 

B 

bed 

T. CH 

catch 

D 

do 

D. J 

edge 

EH 

ten 

EH, EHH 

Ed 

EHl, EH2 

seven 

ER 

weather 

F 

for 

G 

get 

H 

hello 

I 

six 

I, IH 

sin 

11 

inept 

12 

static 

IE 

zero 

lEr E 

three 

IH 

station 

lU, U 

two 

K 

came 

L 

hello 

M 

mile 

N 

nine 

NG 

ring 

0 

no 

01 

hello 

02 

notice 

00 

book 

00, OOH 

bush 

001 

good 

p 

penny 

Q 

quick 

R 

three 

s 

six 

SH 

show 

T 

ten 

TH 

three 

THV 

then 

U1 

thorough 

stirrup 

UH 

but 

UHl 

uncle 

UH2 

UH3 

apple 

V 

seven 

W 

won 

Y 

2H 

sixty 

azure 

Y1 

yes 

Z 

zero 
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Vowel sounds which are considered by the unsophisticated 
to be basic sounds can be in reality glides of vowels in 
combination, such as long A, short A, long E, short E, 
long I, short I, long 0, and long U. Other sounds are also 
combination of two or more basic sounds, such as combination 
of T and CH in "CATCH"* The other numbered phonemes are 
lesser stressed versions of those listed in Table I, 

It is important to know the many glides found in English 
and how to generate them with the VOTRAX equivalent. 

Table II shows some examples. 


TABLE II 

GLIDES IN ENGLISH AND VOTRAX PHONEME EQUIVALENTS 


Common Phoneme Name 

VOTRAX Phoneme Equivalent 

long A in name 

A AY 

short A in hand 

AE EHH 

AW in raw 

UH AW 

long E in three 

IE E or IE Y 

short E in Ed 

EH EHH 

long I in high 

AH lY 

short I in is 

I IH 

long 0 in low 

UH 0 

OW in cow, OU in out 

AH 0 

01 in noise 

0 12 Y1 

short 0 in hop 

UH AH 

long U in tune 

lU U 

double 0 in took 

OOH 00 

double 0 sound in bush 

00 OOH 

J, DG in edge 

D J 

G in George 

D J 

CH in chair 

T CH 

NK in thank 

NG K 

NG in engage 

N G 

NG in engine 

N D J 

beginning Y like in yes 

Y1 
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V/ith all vowels, appropriate stress must be determined 
and the corresponding phoneme chosen. The word "eight", for 
example, is spelled A AY T while the word "eighty" is spelled 
A1 AY TY. The A1 in place of A shortens the syllable of 
"eighty" to correspond with our natural pronunciation of that 
word. As a general rule, longer words have short syllables 
and require more higher numbered, shorter vowel commands. 

The more elaborate synthetic English speech rules (1) 
you follow, the more intelligible speech you get. 
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CHAPTER III 


HARDWARE CONFIGURATION 

The hardware configuration of this VOTRAX Voice 
Synthesizer terminal is shown in Figure 1. 



Fig, 1—Hardware configuration. 


The VOTRAX is connected to a Fairchild F24 computer 
which has 4196 words of 24 bits. The interface was 
designed (1) as a North Texas State University Computer 
Sciences course project. 
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General Description of Fairchild F2k Computer 

The Fairchild F24 mini-computer used on this project is 
a high-speed general purpose computer. 

Donovan (2) has stated general rules for describing a 
computer; listed below are the characteristics of Fairchild 
F24 computer, following Donovan's outline (3). 

Memory 

The basic unit is a word of 24 bits (or bytes) of 
information. The unit size of core memory is 4k words and 
is expandable in modules of 4k up to l6k words. All l6k 
words are directly addressable. The complete memory cycle 
requires 1.6 microseconds. Memory address zero is reserved 
for the interrupt return address storage. Memory locations 
1 through 63 are reserved for storage of indirect addresses 
for the external interrupt system. 

Registers 

The Fairchild F24 has 1 accumulator and 1 accumulator 
extension register, each consisting of 24 bits, and 8 index 
registers consisting of l4 bits each. 

Data 

All data and instructions are stored as sequences of 
binary ones and zeros. 



io 


Instructions 

The Fairchild F24 computer has arithmetic, logical, 
control, and special interrupt instructions. Typical 
instruction formats are as shown in Figures 2 and 3. 


OP code 

_— 

Index 

I 

Operand address 


23 17 r5 14 13 U 


Operand address Bits O-13 

Indirect address control Bit l4 

Index address Bits 15-1? 

Operation code Bits 18-23 

Fig. 2—Typical F24 memory reference instruction format. 




A 

R 

Command 

Unit address | 


!§■ 171^ 15 7 ' ~'o ' 


Unit address Bits 0-7 

Command Bits 8-15 

Transfer direction control Bit I6 

Accumulator transfer control Bit 17 

Operation code Bits I8-23 


Fig. 3—P24 SPU (Select Peripheral Unit) instruction 
format. 


Special Features 

The F24 has I6 external interrupt channels with up to 
63 distinct interrupt locations. There are also 8 
programmable state-switch bits and 6 console switches, in 
addition to a large number of switches used for initiali¬ 
zation and diagnosis of the CPU. 
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General Description of the VOTRAX 
The VOTRAX (Model #5-2219-001-00) can be described as 
an electronic simulator of the human vocal system, its 
audio output can be transmitted over speaker systems or 
telephone lines. The manufacturer states that by sending an 
appropriate string of phonemes, any word in the English 
language can be uttered, in practice, however, it was found 
that the quality of speech production of the model used is 
not adequate for many purposes. 

The characteristics of a phoneme are selected by the 
value of a parallel 8-bit binary word. Two of the 8-bits 
select one of four inflection levels, the other 6 bits 
select one of 64 phonemes. 

Under digital computer control, the vocabulary or a 
string of phonemes can be stored in computer memory or 
secondary storage and then transmitted to the VOTRAX for 
pronunciation. 

l/o Characteristics of the VOTRAX 

The VOTRAX input and output logic signals are TTL com- 
patiable. Signal description (4) is shown in Figure 4. 

The logic signal, Input Set, enables the VOTRAX to 
receive data upon the initiation of the data strobe. Each 
strobe enters one phoneme and its inflection into an 80 - 
character shift register buffer. Input Set is complemented 
by an Output Set pulse. When in the output mode, no input 
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data is accepted. Inversely, no output can be performed 
while in the input mode. 


Pin no. Function 


5 BitO data in - 

9 Bitl data in - 

7 Bit2 data in - 

1 Bit3 data in - 

15 Bit4 data in -- 

3 Bits data in - 

11 Inflection LSB in 

23 Inflection MSB in 

10 Input set - 

l4 Output set - 

24 Data strobe - 

2 Normal set -—^—- 

6 Repeat set-- 

13 Busy - 

22 Audio line •* - 


Fig. 4 —Parallel TTL l/o interface. 


VOTRAX 


H 

O 

u 

-p 


O ‘m 
O 


(§ 


H + 


-P 
O O 
•H rt 
C U 
O Eh 
U 

•P H 
o ci 
0 ) o 

iH O 
W > 


Power 

Supply 


115 VAC 
60 Hz 


Normal Set is the regular application of VOTRAX. 

Repeat Set is for repeating a message. 

For maximum input (80 bytes) of phoneme data, a pause 
phoneme (PAO) must be entered as the first data. When a 
phoneme byte other than null (unsounded, 0 time code) 
reaches the end of the VOTRAX buffer, the VOTRAX speaks out 
the first phoneme even though it is still in the input mode. 
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When in the output mode, the busy signal is set and 
VOTRAX speaks out the phoneme command in sequential order. 

The rate of talking can be adjusted manually through a speech 
rate controller on the front panel of the VOTRAX. 

When VOTRAX is in the output mode and the buffer is 
empty, the busy indicator automatically is reset after 768 
milliseconds. The VOTRAX continues to output nulls as long 
as it is in the output mode. 

Interface VOTRAX with Fairchild F24 computer 

Execution of the Select Peripheral Unit (SPU) 
instruction (5) of the P24-Pairchild computer presents the 
least significant 18 bits of the SPU instruction word to the 
system peripherals on the accumulator bus during the time 
the peripheral select (PS) synchronization signal is ener¬ 
gized. At Tj^ of SPU, the least significant 18 bits of the 
command register are gated to the accumulator bus. Each 
peripheral device receiving the PS signal decodes the least 
significant 8 bits of the accumulator bus to determine whether 
or not these bits contains its own unit code. If the SPU 
instruction is of the form requiring an input or output 
accumulator transfer, the data on the accumulator bus at 
T^ time of the SPU is loaded into the accumulator, or the 
contents of the accumulator is gated onto the accimiulator bus 
at T^, depending on the values of bits 1? and l6 of the 
command. If bit 17 is set in the SPU command there is an 
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accumulator transfer made at of the SPU. If bit l6 is 
not set, the transfer is from the accumulator to the addressed 
peripheral via the accumulator bus. 

From the logic design drawing (see Appendix B) of the 
interface device and the SPU instruction format as shown 

in Figure 3* the SPU instruction bit pattern for the VOTRAX 
is as follows I 

1. The device code of VOTRAX (3718) is decoded from 
the SPU command, bits 0-?, at time. 

2. Bit 10 of the SPU must be "I” to enable 
Input/Output. Bit 12 of the SPU must be "1" to enable 
Normal/Repeat. 

3. Since a "low" voltage to the Input, Output, 

Normal, and Repeat sets of VOTRAX means "true"; 
therefore, if we want the VOTRAX to be in the Output- 
Repeat mode, bit 9 and bit 11 must all be "0", 

4. The data from accumulator bus is gated at 
time for strobe-in. 

5« The two D flip-flops serve as a two-bit shift 
register gating the device code to the data latching 
at T^ time. 

6. The strobe-in as shown in Appendix B is "0" 
most of the time, but when it is the proper device code 
and bit 8 is "1” then there will be a pulse sent out 
and the latched data transfered into the VOTRAX buffer. 
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CHAPTER IV 


SOFTWARE CONFIGURATION 

The main programming effort was to convert words and 
sentences to phonetic images in a form acceptable by VOTRAX. 
The phonetic string is then transmitted from the computer 
memory to the VOTRAX buffer for speech generation# 

The main system flow is shown in Figure 5* 


Source 

The input to the PHONO program (PHONO) is called the 
source. The source can be entered through Teletype, card 
reader, or paper tape reader. 

The source is a string of characters. PHONO recognizes 
certain characters and certain groups of characters as 
different types of atoms. A syntactic element of the PHONO 
recognized by its specific class is called an atom. The 
atoms recognized are inflection levels, phonemes, words, and 
delimiters. 


Inflection Levels 

Four types of inflection levels are defined for the 
PHONO. These are* 

1 " Lowest (shorter and softer amplitude) 

2 - Low (short and soft amplitude) 

3 - High (long and loud amplitude) 

4 - Highest (longer and louder amplitude) 
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Pig. 5—Main system flow. 
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An inflection level is represented as an integer, having 
the range 1-4, followed by a phoneme. 

Phonemes 

Sixty-three different phonemes are defined for the PHONO 
(see Appendix A). A phoneme has to be preceded by an in- 
flection level or a slash •'/•■. in the case of a slash, the 
phoneme assumes the default inflection, which is the in¬ 
flection level of the previous phoneme. 


Words 

A word for the PHONO has to be defined to be spoken. 

The phonemes of a word have to be in the PHONO vocabulary 

table. The construction of the vocabulary table is discussed 
later in the chapter. 


Delimiters 

The delimiters are classified as commands and break 

characters. A command is a class of delimiters which is 

preceded by a dollar sign, The commands ares 

$ Repeat speak VOTRAX code, erase the code list. 
$S Speak once from the code list. 

$R Repeat speak, using the code list. 

$Q Quit speech, 

$EC Erase the VOTRAX code list. 

$ES Erase the source list. 

$D Go to program Memory Dump. 

$L Go to program Loader. 
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The break characters serve the general ftmction of 
separating phonemes and vocabularies from other phonemes and 
vocabularies. The break characters are* 

b b represents the class of spaces—a space, 
a comma, a period, a bar, a hyphen, or any 
number of combinations of them, 

/ A slash indicates the beginning of a phoneme. 

Vocabulary 

The result of defining the words to be used is called 
the vocabulary. The vocabulary consists of a collection (or 
set) of phrases and their associated words, A phrase consists 
of one or more words (keys) in a sequence. Examplei 'I AM THE 
PRESIDENT', The word identifier, which uniquely identifies 
a word, is referred to as the key of the word, A word con¬ 
sists of zero or more keys and/or zero or more basic consti¬ 
tuencies called atoms (or phonemes) in sequence. The vo¬ 
cabulary table is an ordered (sequential) list of elements. 

The information of a phrase definition consists of a set of 
atoms. The classes of atoms arei 

1. Word increments 

2. Keys 

3. Terminators 

4. Indices 
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Word Increments 

Byte zero of a vocabulary table elements has a number, 
called word increment, which specifies the number of con¬ 
secutive computer words for a specific vocabulary in the 
table. A typical word increment is as shown in Figure 6. 

The definition of the vocabulary "ANSWER", for example, 
takes 4 computer words; where "4" is the word increment for 
the vocabulary "ANSWER". The increment word serves as a 
secondary key (l) when searching a vocabulary in the vo¬ 
cabulary table; it identifies all those vocabularies which 
have a certain n-umber of characters. Only when the word 
increments match is an actual vocabulary search performed. 
This technique eliminates the exhaustive comparison of 
character by character in the process of searching a vo¬ 
cabulary. 


byte 0 


K 

A 

4 

E 

W 

S 

3778 

3778 

R 

index 


Fig. 6—Vocabulary table element. 
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Keys 

Following the word increment in a vocabulary table 
element is a string of ASCII characters called a key. The 
A-k-S-W-E-R as shown in Figure 6, for example, is the key. 

Terminators 

When a computer word is not fully occupied by the 
vocabulary, the rest of the computer word is filled with all 
"l*'s, called terminators, as shown in Figure 6. The termi¬ 
nator is represented by an easily recognizable value, ”3773"i 
which can not be mistaken for an ASCII character. 

Indices 

The last computer word of a vocabulary table element 
is an index into an index table. The index table has a list 
of the byte offsets into the source list (a set of the vo¬ 
cabulary phoneme code definitions) as shown in Figure 7* 

Translation Algorithm 

The translation algorithm of the PHONO program is shown 
in the main system flow chart of Figure 5- 

Data Bases 

Phoneme table .—Each entry has two computer words. The 
first word contains the ASCII phoneme name left justified. 
The second word is the corresponding VOTRAX phoneme code. 
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Vocabulary table «—A list of all the defined vo¬ 
cabularies. 

Source list .—A set of the phonetic codes definitions 
for the vocabularies. 

Code list .—A list of the generated phoneme code ready 
to be sent to the VOTRAX. 


Vocabulary 

Table 


Index 

Table 


Source 

List 



byte 

2 

byte 

1 

byte 

0 

* . 

byte 

3 

2/n 

2/AE 

■ 

■ 

i/er 

l/s j 


Fig. 7—The linked relation between a key and the 
corresponding phonetic codes or keys definition. 
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Phoneme Codes Generation 

Scanning determines the text elements. If an inflection 
level is detected, the inflection prefix is set to the 
corresponding value. When a phoneme is detected, it is used 
as a search argument into the phoneme table. If the search 
is successful, the associated octal code is OR'd to an in¬ 
flection prefix for the derived sound; and then stored in the 
output phoneme code list first in first out (FIFO). 

When a "word" is detected, it is used as a search 
argument into the vocabulary table. If the search is 
successful, the corresponding phonetic codes definition is 
retrieved from the source list and moved to the phoneme code 
list. If a PHONO command is detected, it is tested to see 
if it is a VOTRAX command. If it is a VOTRAX command, the 
output phoneme code list is transfered to the VOTRAX buffer 
for speech generation. 
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CHAPTER V 


THE OPERATION OF THE PHONO PROGRAM 
Loading 

The PHONO object program, which is in the form of 
punched-card output of the P24 Assembler (1) run on the 
IBM/ 360-50 in the University Computer Center, can be loaded 
into the F24 memory from the card reader. 

The procedure is as follows 1 

1. Turn on the CPU and card reader power and set 
them in the STOP state. 

2. Place the one-card Bootstrap Loader (2) in the 
card reader hopper. 

3 . Turn card reader VALIDITY off, 

4 . Depress card reader RESET and START. 

5 . Press CPU RESET and LOAD. 

6. Set conunand register switches to OlOOOlOOo. 

0 

7 . Press RESET and LDCR. 

8. Press START up and down. 

9- Switch the Teletype power switch to ONLINE 
(counter clockwise). 

10, Place the Memory Dump Program ( 3 ) in the Teletype 
paper tape reader. 

11. Switch the Teletype paper tape reader to RUN. 


26 



2? 


The Memory Dump Program will be read into core. 

12. Place the Card Loader Program (4) in the Teletype 
paper tape reader, 

13• Switch the Teletype paper tape reader to RUN. 

The Card Loader Program will be read into core. 

14. STOP the CPU. 

15. Place the PHONO object program, which should be 
followed by a blank card, in the card reader hopper, 

16. Turn card reader VALIDITY off. 

17. Depress card reader RESET and START. 

18. Set command register switches to 01000300 q. 

0 

19. Press RESET and LDCR. 

20. Press START up and down. The PHONO program will 
be read into memory and gain control. 

21. The PHONO program will then prompt a user 
response by typing 

> 

on the Teletype. 

Commands can be input from the Teletype input, the paper 
tape reader, or the card reader. 

State Switch 13 (Console Switch 3) is used to control 
the trace (VOTRAX octal code) of the VOTRAX output on the 
Teletype; Console Switch 3, if up, is equivalent to TRACEON. 
State Switch (Console Switch 5) is used to control the echo 
of the input on the Teletype; Console Switch 5, if up, is 
equivalent to NOECHO. 



Restart 


The PHONO program can be restarted at any time by 
pressing STOPi setting the command register switches to 
01001OOlg and then pressing LDCR and START. The PHONO 
program assumes operation as in the state it is first started 

except it does not erase the vocabulary table and the phoneme 
code list. 


PHONO Commands and Examples 
■—These characters are used to separate 
other groups, such as words. Members of the group are all 
equivalent. The group consists of the blank space, return, 

and line feed. A blank which immediately follows a blank 
is ignored. 


arrow or underline .—Causes the last character 
typed in to be ignored, unless the last character was a 
blank. Examples* 

"UV4H2" is the same as "UH2". 

"C0NP«MPUTER" is the same as "COMPUTER". 

Digit, 1 to 4.—Sets the inflection level. A question 
mark (?) is printed and field ignored if the digit is outside 
the inflection level range (1 to 4). Example* 

2 /ae /n l/s /eR 

A slash is printed immediately after the digit by 
the PHONO program. "AE" above has an higher inflection level 
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than the "S". The phoneme assumes the previous inflection 
level when no inflection level (j^phoneme) is specified. In 
this case, "N” has the same inflection level as "AE". 


Phoneme. Specifies one of the phonemes. A <iuestion 
mark (?) is printed and field ignored if the phoneme entered 
is illegal. Examplei 

"AEl" is translated to the VOTRAX code value of SI a* 

o 

^■■(Hyphen) .--Phoneme to speak a brief pause between or 
within words (Hyphens may not be used in spelling or punctu¬ 
ation), Example I 

PRESIDENT NIXON NOMINATED A NEW VICE-PRESIDENT. 


JL- (Comma).—Phoneme to pause between phrases. The comma 
pause is longer than the hyphen pause (Commas may not be used 
in spelling). Example* 

TODAY, PRESIDENT NIXON NOMINATED A NEW VICE-PRESIDENT. 


—(Period).—Phoneme to pause between sentences. The 
period pause is longer than the comma pause (Periods may not 
be used in spelling). Example* 


--..(.Do^bie quote) .—Defines vocabulary (enter a new 
source in the vocabulary table). Example * 


A response (left parenthesis *'(■’) is typed on the Tele- 
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type, and then the user can enter the vocabulary ending with 
a right parenthesis 

"(ANSWER) 

A response (colon is typed immediately following the 

at this time the user can enter the definition of the vo¬ 
cabulary ending with a carriage return or a double quote. 

The vocabulary definition can be a combination of phonemes 
and pre-defined keys. 

1. "(ANSWER). 2 /ae /n l/s /ER 

2. "(ANSWERYES)I ANSWER 2/Y1 /a 1/ay /S" 

The vocabulary can be redefined (erase the previous 
definition), which enables the user to modify the pronunci¬ 
ation of the vocabulary. 

3. "(ANSWER)I 3/AE 2 /n l/s /ER 

L. .(Left parenthesis ) .—Same as """ (double quote), 

except no first response (left parenthesis "(") is typed on 
the Teletype. 

—Quit speech. Examplei 

$Q 

"UIET" is printed immediately following "$Q". Nulls are 
sent to VOTRAX. hence the VOTRAX quits speaking. 

$S. Speak once from the phoneme code list. Examples* 

1. 2/AE /n 1/s /eR $S 

"ANSWER" is uttered once. 
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2. ANSVffiR $S 

"ANSWER" is uttered once. "ANSWER" must be pre¬ 
defined. 

3- $S 

What is in the phoneme code list is uttered once. 

$R .—Speak repeatedly from the phoneme code list. 

Similiar to "$S"; the only difference being that this command 
repeats the uttering. 

$blank .--Repeat speaking the phoneme code, then erase the 
phoneme codes list. Examplei 

2/AE /N I/S /ER $ 

"ANSWER" is uttered repeatedly. Since the phoneme code 
list has been erased, no sound would be output if a "$S" 
command were next typed in. 

$EC .—Erase the phoneme code list. 

$ES .—Erase the source list. 

$1 .—Go to program Loader. Examplei 

$L 

"OAD" is printed immediately following "$L". Control is 
transfered to the Bootstrap Loader (location lOOg), hence PHONO 
enables the user to load and execute a new program. 

|D.--Go to the program Memory Dump in order to diagnose 
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programming errors. Example* 

$D 

"UMP" is printed immediately follows The starting 

dirnip address is entered from the Teletype. The memory dump 
may be aborted at any moment by striking any key on the Tele- 
type. 


Error Messages 

Too many symbols .--If the user tries to define a vo¬ 
cabulary which has more than 20 characters, the above error 
message will be typed on the Teletype, 

Too many source words ,—When the vocabulary table is 
full, the above error message will be typed on the Teletype 
and the CPU halted at location lOOlg, 

In this case, the user can press START up and down and 
restart again. Since the vocabulary table is full, the user 
may either issue a ”$ES" command to erase the vocabulary 
table or continue without trying to define more vocabulary. 
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CHAPTER VI 


DISCUSSION 

The VOTRAX terminal can he implemented on numerous 
applications ranging from computer aided instruction systems 
to on-line student information inquiry systems (1). 

The present system is based on the available Fairchild 
F24 mini-computer in the Department of Computer Sciences, 
which has only 4k memory of 24 bits/word. The size of the 
vocabulary is limited to a few himdred words. Suppose we use 
the time-sharing computer (HP-2000) in the University Computer 
Center, which has available mass secondary storage, as a host 
computer remotely accessed by the F24; the vocabulary size 
is then virtually unlimited. The proposed new system flow 
is shown in Figure 8. 

The vocabulary file can be read into the F24 computer 
core memorys then the file transfered to host CPU memory by 
a program written in BASIC. After sorting by merging (2), the 
vocabulary file just read in will be merged into the previous 
vocabulary file, hence eliminating re-constructing the 
lexicographical (3) vocabulary file. 

The most-often used words can be kept in the F24 mini¬ 
computer for speed considerations. As for the vocabulary 
search of those stored in the host time-sharing computer 
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secondary storage, a record can be brought into the F24 
memory by retrieving on secondary keys. A binary search (3) 
is then performed and phonemes sent to VOTRAX for speech 
generation. 

By utilizing a host computer the range of applications 
would be greatly extended, particularly for spoken output to 
file inquiries. If an outside telephone could call the P24, 
connected to the host computer, then a person anywhere in the 
world could receive the answer inquiries relating to large 
data bases. 
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APPENDIX A 


VOTRAX PHONETIC CODES 


Phoneme Octal Phoneme Octal Phoneme Octal 


PAO 

003 

PAl 

A 

040 

A 1 

AEl 

057 

AH 

AW 

075 

AWl 

B 

016 

CH 

E 

054 

EH 

EH 2 

000 

EHH 

F 

035 

G 

I 

047 

11 

IE 

074 

IH 

lY 

Oil 

J 

L 

030 

M 

NG 

024 

0 

02 

064 

00 

OOH 

004 

P 

R 

053 

S 

T 

052 

TH 

U 

050 

U1 

UHl 

062 

UH 2 

V 

017 

W 

Y 1 

042 

z 

*Null 

Code 277 



076 

PA 2 

060 

006 

AE 

056 

044 

AHl 

025 

023 

AY 

04 l 

020 

D 

036 

073 

EHl 

001 

010 

ER 

072 

034 

H 

033 

013 

12 

012 

005 

lU 

066 

032 

K 

031 

0 l 4 

N 

015 

046 

01 

065 

027 

001 

026 

045 

Q 

002 

037 

SH 

021 

071 

THV 

070 

067 

UH 

063 

061 

UH 3 

043 

055 

Y 

051 

022 

ZH 

007 
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source, "low" during clock pulse. 






BIBLIOGRAPHY 


Books 

Donovan, John J,, Systems Programming. NY. MoGraw-Km 
Book Co., 1972 , 

Flanagan, J, L., Speech Analysi s Synthesize and Perception, 
Berlin-Heidleberg, Springer-Verlag, 1972. 

Knuth, Donald E., The Art of Computer Programming, Vol. 3 
(3 volumes), CiIifornIa,'Addison-Wesley Publishing Co., 


Articles 

Mi About Voice Response , Datapro 70 , September, 1974 . 

Bulletin #ETB- 5 - 0302 -IVID, Troy, Michigan, Vocal Interface 
Division of the Federal Screw Works, 1973 , 

Clements, David T., Voice Output for Student Information 
Inouii^, Costa Mesa, California, Coast Community 
College District, 1974 . 

Mcl 3 jroy,_M. D., Synthetic English Speech by Rule, Murray 
Hill, New Jersey, Bell Telephone Laborat^es, March, 


Reports 

Fairchild .F 24 systems Manual (Hardware), Vol. 3 (3 volumes), 
California, Fairchild Instrumentation, 1970 . 


4o 



4l 


Unpublished Materials 

Scott, Dan W., unpublished card loader program, Department 
of Computer Sciences, North Texas State University, 
Denton, Texas, 

Scott, Dan W,, unpublished P24 simulator on the IBM 360, 
Department of Computer Sciences, North Texas State 
University, Denton, Texas. 

Scott, Dan W., "The Fairchild F24 Computer Systems Manual," 
unpublished manual. Department of Computer Sciences, 
North Texas State University, Denton, Texas, 1973. 

Scott, Dan W., unpublished memory dump program, Department 
of Computer Sciences, North Texas State University, 
Denton, Texas. 

Scott, Dan W., unpublished one-card bootstrap loader, 
Department of Computer Sciences, North Texas State 
University, Denton, Texas. 

Smith, Russel, unpublished interface design, graduate 

student, Department of Computer Sciences, North Texas 
State University, Denton, Texas, 1975. 



